Reconstructive and Discriminative Sparse Representation for Visual Object Categorization
نویسندگان
چکیده
Generic Visual Object Categorization (VOC) aims at predicting whether at least one or several objects of some given categories are present in an image. In fact, VOC is a fundamental problem in computer vision and pattern recognition, and has become an important research topic due to the wide range of possible applications such as video monitoring, video coding systems, security access control, automobile driving support as well as automatic image and video indexation and retrieval [4]. Until now, many VOC methods have been proposed and applied to the classification of numerous objects categories like, for example, cars, motorbikes, animals, people, furniture etc. Despite many efforts and much progress that have been made during the past years, it remains an open problem and is still considered as one of the most challenging topics in computer vision [2]. In particular, the image representation is a key problem since, from the image visual content presented in the form of image features, it has to be able to model effectively this content in a discriminative way to allow an efficient classification of the image. In this paper, we propose to adapt the principles of sparse representation theory to the problem of VOC. Thus, we have elaborated a reconstructive and discriminative sparse representation of images, which incorporates a discriminative term, such as Fisher discriminative measure or the output of a SVM classifier, into the standard sparse representation objective function in order to learn a reconstructive and discriminative dictionary. Let consider a set of N training signals {yi}i=1 belonging to M categories. Y = [y1,y2, ...,yN ] is a signal matrix with the corresponding sparse coefficients based on the dictionary D as X = [x1,x2, ...,xN ]. Moreover, we suppose that Ni signals are in the category Mi, for 1≤ i≤M. The objective function of the standard reconstructive sparse representation can be expressed as:
منابع مشابه
A Unified Semantic Embedding: Relating Taxonomies and Attributes
We propose a method that learns a discriminative yet semantic space for object categorization, where we also embed auxiliary semantic entities such as supercategories and attributes. Contrary to prior work, which only utilized them as side information, we explicitly embed these semantic entities into the same space where we embed categories, which enables us to represent a category as their lin...
متن کاملSparse Dictionaries for Semantic Segmentation
A popular trend in semantic segmentation is to use top-down object information to improve bottom-up segmentation. For instance, the classification scores of the Bag of Features (BoF) model for image classification have been used to build a top-down categorization cost in a Conditional Random Field (CRF) model for semantic segmentation. Recent work shows that discriminative sparse dictionary lea...
متن کاملAdding Discriminative Power to Hierarchical Compositional Models for Object Class Detection
In recent years, hierarchical compositional models have been shown to possess many appealing properties for the object class detection such as coping with potentially large number of object categories. The reason is that they encode categories by hierarchical vocabularies of parts which are shared among the categories. On the downside, the sharing and purely reconstructive nature causes problem...
متن کاملSoft Measure of Visual Token Occurrences for Object Categorization
The improvement of bag-of-features image representation by statistical modeling of visual tokens has recently gained attention in the field of object categorization. This paper proposes a soft bag-of-features image representation based on Gaussian Mixture Modeling (GMM) of visual tokens for object categorization. The distribution of local features from each visual token is assumed as the GMM an...
متن کاملSparse Discrimination based Multiset Canonical Correlation Analysis for Multi-Feature Fusion and Recognition
Multiset canonical correlation analysis is a powerful technique for analyzing linear correlations among multiple representation data. However, it usually fails to discover the intrinsic sparse reconstructive relationship and discriminating structure of multiple data spaces in real-world applications. In this paper, by taking discriminative information of within-class and between-class sparse re...
متن کامل